Feature flow: In-network feature flow estimation for video object detection
نویسندگان
چکیده
• A type of shallow modules are proposed to directly predict the feature flow for alignment in a single network. Self-supervision learning is introduced further improve quality predicted flow. new state-of-the-art performance shown by comparing with other methods, while fast inference speed maintained. Optical flow, which expresses pixel displacement, widely used many computer vision tasks provide pixel-level motion information. However, remarkable progress convolutional neural network, recent approaches solve problems on feature-level. Since displacement vector not consistent common approach forward optical network and fine-tune this task dataset. With method, they expect fine-tuned produce tensors encoding feature-level In paper, we rethink about de facto paradigm analyze its drawbacks video object detection task. To mitigate these issues, propose novel (IFF-Net) an I n-network F eature low estimation module (IFF module) detection. Without resorting pre-training any additional dataset, our IFF able indicates displacement. Our consists module, shares features branches. This compact design enables IFF-Net accurately detect objects, maintaining speed. Furthermore, transformation residual loss (TRL) based self-supervision , improves IFF-Net. outperforms existing methods achieves ImageNet VID.
منابع مشابه
Robust Optical Flow Estimation Using Invariant Feature
Traditional methods for computing optical flow are mainly based on image brightness constancy. In the real world the brightness constancy usually does not hold. Here we present the idea of using invariant feature based on the brightness change model to estimate the optical flow. Both the mathematical derivation and the experiments show that the new model is better than brightness based optical ...
متن کاملMotion Feature Detection Using Steerable Flow Fields
The estimation and detection of occlusion boundaries and moving bars are important and challenging problems in image sequence analysis. Here, we model such motion features as linear combinations of steerable basis flow fields. These models constrain the interpretation of image motion, and are used in the same way as translational or affine motion models. We estimate the subspace coefficients of...
متن کاملFisher Discriminant Analysis (FDA), a supervised feature reduction method in seismic object detection
Automatic processes on seismic data using pattern recognition is one of the interesting fields in geophysical data interpretation. One part is the seismic object detection using different supervised classification methods that finally has an output as a probability cube. Object detection process starts with generating a pickset of two classes labeled as object and non-object and then selecting ...
متن کاملDeep Spatial-Temporal Joint Feature Representation for Video Object Detection
With the development of deep neural networks, many object detection frameworks have shown great success in the fields of smart surveillance, self-driving cars, and facial recognition. However, the data sources are usually videos, and the object detection frameworks are mostly established on still images and only use the spatial information, which means that the feature consistency cannot be ens...
متن کاملFeature-Level based Video Fusion for Object Detection
Fusion of three-dimensional data from multiple sensors gained momentum, especially in applications pertaining to surveillance, when promising results were obtained in moving object detection. Several approaches to video fusion of visual and infrared data have been proposed in recent literature. They mainly comprise of pixel based methodologies. Surveillance is a major application of video fusio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2022
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2021.108323